Off-Policy Evaluation in Partially Observable Environments
نویسندگان
چکیده
منابع مشابه
Probabilistic Robot Navigation in Partially Observable Environments
Autonomous mobile robots need very reliable navigation capabilities in order to operate unattended for long periods of time. This paper reports on first results of a research program that uses partially observable Markov models to robustly track a robot’s location in office environments and to direct its goal-oriented actions. The approach explicitly maintains a probability distribution over th...
متن کاملRisk-Sensitive Planning in Partially Observable Environments
Partially Observable Markov Decision Process (POMDP) is a popular framework for planning under uncertainty in partially observable domains. Yet, the POMDP model is riskneutral in that it assumes that the agent is maximizing the expected reward of its actions. In contrast, in domains like financial planning, it is often required that the agent decisions are risk-sensitive (maximize the utility o...
متن کاملPrivacy Preserving Plans in Partially Observable Environments
Big brother is watching but his eyesight is not all that great, since he only has partial observability of the environment. In such a setting agents may be able to preserve their privacy by hiding their true goal, following paths that may lead to multiple goals. In this work we present a framework that supports the offline analysis of goal recognition settings with non-deterministic system sens...
متن کاملInverse Reinforcement Learning in Partially Observable Environments
Inverse reinforcement learning (IRL) is the problem of recovering the underlying reward function from the behaviour of an expert. Most of the existing algorithms for IRL assume that the expert’s environment is modeled as a Markov decision process (MDP), although they should be able to handle partially observable settings in order to widen the applicability to more realistic scenarios. In this p...
متن کاملProbabilistic Navigation in Partially Observable Environments
Autonomous mobile robots need very reliable navigation capabilities in order to operate unattended for long periods of time. We have developed an approach that uses partially observable Markov models to robustly track a robot’s location and integrates it with a planning and execution monitoring approach that uses this information to control the robot’s actions. The approach explicitly maintains...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Proceedings of the AAAI Conference on Artificial Intelligence
سال: 2020
ISSN: 2374-3468,2159-5399
DOI: 10.1609/aaai.v34i06.6590